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' AMI n - 4). SJCRH (childhood AMI n - 5). Of the 
Giildren's Cancer Croup {childhood AML, n * 5) 
leukemia banks. The samples wert processed as de- 
scribed (75), with the exception of the samples from 
SJCRH, which used a very different protocol. The 
SJCRH samples were subjected to hypotonic lysis 
(rather than FkoU sedimentation), and RNA was pre- 
pared by an aqueous extraction (Qiagen). 
24. Although the number of genes used had no signifi- 
cant effect on the outcome In this case (median PS 
for cross-validation ranged from 0.81 to 0.68 over a 
range of predictors using 10 to 200 genes. aU with 
0% error), it may matter in other instances. One 
approach is to vary the number of genes used, select 
the number that maximizes the accuracy rate in 
a oss-validation, and then we the resulting model on 
the independent data set In any case, we recom- 
mend using at least 10 genes for two reasons. Class 
predictors using a small number of genes may depend 
too heavily on any one gene and can produce spuri- 
ously high prediction strengths (because a large 
"margin of victory* can occur by chance -due to 
statistical fluctuation resulting from a small number 
of genes). In general, we also considered the 99% 
confidence line in neighborhood analysis to be the 
upper bound for gene selection. 

25. P. A.' Dlnndori et a/.. Med. Pediatr. Oncol 20. 192 
(1992); P. S. Master. 5, J. Ricwds, J. Kendall. B E 
Roberts, C. S. Scott Blut 59. 221 (1989): V. Buccheri 
er at.. Blood 82. B53 (1993). 

:6. M Konopleva er ai. Stood 93. 1668 (1999). 

27. A, W. Crawford and K C. Beckerte.y. Biol. Chem. 266 
5847(1991). 

28. W. Ross. T. Rowe. B. Clisson. J. Valowich, L liu 
Cancer Ret. 44, S8S7 (1984). 

i9. Treatment failure was defined as failure to achieve a 
complete remission after a standard induction regi- 
men including 3 d»ys of anthracycline and 7 days of 
cytarabine. Treatment successes were defined as pa- 
tients in continuous complete remission for a mini- 
mum of 3 years. FAB subdasj M3 patients were 
excluded but samples were o:herwise not selected 
*rith regard to FaB criteria. 

10. |. Borrow er at, Nature tenet. 12. 159 ;i996): T. 
Nakamura. et a/., ibid., p. 154; S. Y. Huang er */. Br J 
Haematol. 96. 68: (1997). - 

1. I Kroon ef a/., £MBO J. 17. 11U (1998). 

2. >. Tamayo er j/., Proc. Nat!. Acad. Sci. U.S.A. 96 
2907(1999). 

3. The SOM was constructed using our GtNECLUSTER 
software (32). with a variation filter excluding genes 
"vith less than fivefold variaiior across the collection 
of samples. 

4. for testing putative clusters denved from the SOM or 
chosen at random, we constricted class predictors 
.vith various number of genes (ranging from 10 to 
•00) and selected the one wfch the highest cross- 
.alidation accuracy rate (in this case, 20 genes). 

3. S related approach would be tc represent each dus- 
:er only as the subset of points lying near the ceh- 
sroid of the cluster. 

5. /arious statistical methods can be used to compare 
ihe predictors derived from the SOM-derived dusters 
with predictors derived from random classes. We 
compared the median prediction strength. Specifical- 
ly. 100 predictors corresponding to random dasses of 
comparable size were constructed, and the median PS 
lor each predict* was determined. The performance 
for the actual predictor was then compared to the 
distribution of these 100 median PSs. to obtain em- 
pirical significance levels. The observed median PS in 
the initial data set was 0.B6. *hich exceeded the 
nedian PS for a .| 100 random predictors; the empir- 

significance level was thus <1%. The observed 
r,edian >S for the independent data set was 061 
which exceed the median PS for all but 4 of the 100 
waTwSus^ Utad0nS: ^ em P lrical iigniftance level 

- Various approaches can be used to test classes C, C 
- <-„ arising from a multinode SOM. One can cor£ 
svuet predictors to distinguish each pair of classes 
(C; versus C,) o- to distinguish each class for the 
complement of ;he class (C, versus not C). Here we 
u^d the pair-wise approach (C- versus C) For 
c-oss-valldatlon one can restrict attention to sam- 



ples known to lie in the union of C, and C. For an 
independent data set. one mu t examine alt sam- 
ples (because it is unknown v hich samples He in 
the union of C, and C,). It t iay be possible to 
improve the statistical power : f this lest by using 
techniques for multlclass predi :tion. 

38. Thirty-three ALL samples were ested by cross-vali- 
dation using a 50-gene predleti r. Thirty-two of 33 
samples were correctly assignee as T-ALL or B-ALL; 
the remaining sample received i PS < 0J. and no 
prediction was therefore made. >euRs are provided 
on our Web site. 

39. T. R. Gdub, unpublished results. 

40. S. Turc-Carel eta/., Cancer Cener Cyiogenat. 19. 361 
(1986); L C Douglass et a/., tbk 45. 148 (1987). 
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Polysaccharides 
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Although rapid sequel 
commonplace, It has n 
amounts of tissue-der 
noglycans (HLCACs) 
assisted laser desorpti» 
for representation of c 
of sequences that are < 
cell growth, and differ 



cing of polynucleotides and polyp 
rt been possible to rapidly sequence 
ved complex polysaccharides. Hep 
vere readily sequenced by a com 
n ionization mass spectrometry an 
olysaccharide sequences. This will . 
ritical to HLCAG biological actlvitie 
mtiation. 



iptides has become 
femto- to picomole 
arm-like glycosami- 
^nation of matrix- 
i a notation system 
•nable identification 
J in anticoagulation, 



The chemical heterogeneity >f polysaccha- 
rides, their structural compiexi y. and the lack 
of effective tools and methods have seriously 
limited the development of a equencing ap- 
proach that is rapid and prac:icaL like that 
used for polynucleotides and polypeptides. 
This limitation is especially :elevam in the 
study of elycosaminogiycan KiAG) complex 
polysaccharides, which are prt sent ai the cell 
surface and in the e.xtracellula matrix ( ;. :). 
Heparin or heparan sulfate-Iikt n|ycosamino- 
glycans (HLGAGs), a subset of G AGs. are 
currently used clinically as ; nticoaizulants. 
and this function of HLGAG: has been as- 
signed to a specific pentasacch .ride sequence 
that is responsible for binding t ■ antithrombin 
III i S). Recent progvss in dev -lopmental bi- 
ology, genetics, and other fielt s has resulted 
in a virtual explosion in the discovery of 
important roles for HLGAGs in the biological 
activity of morphogens (4) (for example. Wing- 
less. Decapentaplegic, and Hedgehog); growth . 
factors, cytokines, and. chemojiines (5); en- 
zymes (/, 6); and surface proteins of micro- 
organisms (7). Although, it is increasingly rec. 
ognized that a specific «quence,;typically from 
a tetra^ to a decasaccharide in.siiie; is ; resj>oiisi- 
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ble for HLGAGs" nodulation of biological ac- 
tivity, in only a fe\ ■ caies is there any structural 
information rcuar ling sequences (A 1 ;. There- 
fore, accelerating >ur understanding of struc- 
ture-function reJa ionships for HLGAGs re- 
quires the development or rapid yet thorough 
sequencing mcrhoJolocies. 

There are mai y issues that have limited 
the development of sequencing techniques 
for HLGAGs. HLGAGs are chemically com- 
ple\ and hetcrogei eous. because the HLGAG 
chain can vary ii terms of the number of 
disaccharide repea anils and posicsses. with- 
in the disaccharidi repeat unh, four potential 
sites for chemica modification. The basic 
disaccharide repc .t unit of HLGAG is a 
uronic acid [ct-L- duronic acid (I) or g-D- 
glucuronic acid (C i) linked 1,4 to Q-D-hcx- 
osamine (H) (Fig 1A). Touether, the four 
different modiflcai ions (2** = 16) for an I or 
G uronic acid isomer containing disaccharide 
give rise to 16 x : [ = 32 different plausible 
disaccharide units .for HLGAGs. In contrast, ■ ' 
four bases make up: DNA, and 20 amino acids 
make up proteins,: With these 32 building . 
blocks, an octasacf : :haride could have over a 
million possible sequences, thereby making , . 
HLGAGs not only the most acidic but also 
the most informitiori-dense biopolymers 
found in nature. TT;ere are no methods avail-'" . 
able to amplify or j^oduce HLGAGs in large - 
amounts, unlike the techniques that are avail- ■ . 
able for DNA or proteins. 

To handle the enormous information den- . . 
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sity of HLGAGs.. we have developed a hexa- 
decimal notation system (base 16) to repre- 
sent the four potential sites for chemical mod- 
ification in the disaccharide repeat unit (9). 
The functional groups modified by sulfates 
can be represented by a binary digit (0 for no 
sulfate, and I for the presence of a sulfate) 
Thus, each possible HLGAG disaccharide 
can be assigned a one-letter code based on its 
hexadecimal value (Fig. I B) (10). The isomer 
of the uronic acid can be assigned a positive 
(I) or a negative (G) sign with the same 
one-letter code. With such a coding scheme 
mathematical operations (addition, subtrac- 
tion, and so on) can be used to encode infor- 
mation such as susceptibility to enzymatic or 
chemic al cleavage of specific sites and chem- 
ical properties that could facilitate sequence 
comparison or alignment (P). Furthermore, 
this code can be easily expanded in the future 
to add other modifications that mieht be ob- 
served, through the addition of binary digits 
to expand to a higher numerical base! 

Different combinations of the 32 buildin* 
blocks yield tetnx hexo-. or longer saccha- 
rides, and it is possible to generate a list of 
theoretical molecular masses of all possible 
saccharide sequences as well as to uniquelv 
assign :he length of the saccharide based on 
the molecular weight (Fig. \Cl For instance, 
the minimum difference between any disac- 
charide and any tctrasaccharide is 101 dal- 
tons. Further, within an oligosaccharide larg- 
er than a disaccharide. there is a minimum 
difference in mass of 4.02 dahons that is ac- 
counted for by two acetate groups (84.0S dal- 
tons) replacing a sulfate group Ifc0,06 dahons). 



HE P O RTS 

Therefore, the length and the number of sul- 
fates and acetates ar : readily assigned for a 

: up to a tetradecasac- 
cnande. The matrix- listed laser desorption 
ionization mass spe. trometry (MALDI-MS) 
technique described rare enables the determi- 
nation of the mass of HLGAG complex oligo- 
saccharides (from di- o decasaccharides) to an 
accuracy of < I daltor , with a sensitivity down 
to 100 frnol of materi, .1 (7/). The combination 
of the notation syste n and the accuracy of 
MALDI-MS enabled us to determine mass, 
identity relationslups. 

ut H 1 !?: ° Vera]l :trate S>' for syncing 
HLGAGs essentially involves the reduction 

■n size of the startirg oligosaccharide into 
smaller fragments through the successive use 
of experimental const -aims such as chemical 
and enzymatic degrad ition, as well as the use 
of MALDI-MS to det. rmine the length of the 
saccharide and the nimber of sulfates and 
acetates in it. The donation of sequences 
that do not satisfy he experimental con- 
straints rapidly enab es convergence to a 
umque sequence afic successive iterations 
Speafic examples o: how one would se- 
quence HLGAGs witl ihis process arc illus- 
trated below. 

Example J; MALE (-MS of the saccharide 
pomts to a mass of !230.: daltons. corre- 
sponding to an octas;ccharide with II sul- 
fates and no acetates Fig. 2A). Analysis of 
this saccharide by ca miliary electrophoresis 
iCE) shows the prcserce of a trisulfated dis- 
accharide QU 2S H VS 0 , ; >D) and a disulfated 
disaccharide aL'H ss , s : =5) with relative 
abundance of 3.1 (Fit. 2A, inset). Because 



repeating unit. (A) Di- 
saeehande repeating unit containing a uronic acid 
ja-i-iduronate (I) or p-D-glucuronate (C)l with a 
1-4 linkage to o-D hexosamine (H). The functional 
groups can be modified (primarily sulfation) at the 

M"M^t {X h and N * ulfation w Ketyiation 
[i). \S) The hexadecimal notation system repre- 
senting the U-H disaccharide units of HLGAGs Col- 
umns 1 through 4 represent binary digits that indi- 
cate the presence (1) or absence (0) of sulfates at 
we Z. 6. 3. and N positions of the disaccharide 
^'* A ,° * the N position represents a 
N-acetylated glucosamine. Column 5 shows the 
hexadecimal letter code that is encoded by the 
binary digits of columns 1 through 4. The hexadec- 
imal units are assigned a sign based on the isomeric . 
state of the uronic acid; positive (+) hexadecimal 
'S repr " en * -staining units and negative (-) 
hexadecimal digits represent G-containing units. 
The conventional chemical nomenclature is shown 
in column 6. The calculated molecular weight of the 

,W""? e U "! t! (* if the y vvere internal in a 
chain) , s shown in column 7. Free amine-conUining. • 

fo^tV W -, fe 001 USed in tnis b «t thif V. . 

teS£?u e3s,l y . extended to include »«* modi- 

or e£CL,' re ■"J? fl 8 s - 2 <• for chemical-, ^ 

me ?SZ C modifi , ati <™ t° these disaccharides. 



both the I- and G-containing octasnccharides 
need to be considered (J2) f the number of 
possible combinations of sequences having 
these dis iccharide units is 32 (Fig. 2D, part if 
If heparnase I digestion data (Fig. 2B) are 
"sea 1 as a constraint, only 4 out of the 32 
sequences can result in fragments that corre- 
spond to these observed masses (Fig. 2D, pan 
u). Thus the heparinase I data fix two di- 
saccharid : units at the nonreducing end. How- 
ever, for i ie reducing end, having +5 (1H NS hS ) 
or 5 W H N s,fls) ^ the terminus or having +5 
or -5 at he second position from the terminus 
are possMle (Fig. 2D, part ii). To converge 
fcrther on the octasaccharide sequence, wc used 

u , 0f in Fig. 2D, part ii 

the only s -quence that can satisfy the observed 
masses o the products of heparinase III di- 
gestion ,: ±DDD-5, which corresponds to 

Examj le 2; Here we illustrate the use of 
MALDI-MS and exoenzymes to sequence a 
nitrous acid-derived hexasaccharide. Partial 
digestion 3f the hexasaccharide with nitrojs 
acid result in a ladder of hexa-. tetra-, and 
disaccharides (Fig. 3A). The nonreducing end 
of all the saccharides in the ladder can be 
sequencer simultaneously with exoenzymes 
(U). Upoi treatment of the nitrous derive-' 
products \ ith the exoen2>Tnes (iduronate 2-0 
suifatase, duronidase. and glucosamine 6-0 
sulfatase). the hexasaccharide and tetrasac- 
charidc peaks were correspondingly reduced 
in mass, inflecting the removal of 2-O-sul- 
fate, idurciate. and 6-0 sulfate from the re- 
ducing en I of the products (Fig. 3B). Com- 
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^itional analysis by CE (by means of he- 
rinase I) indicated that disaccharide present 

the reducing end was L 5 Man 6S (+D'). 
Aether this information allows us to con- 
r ge to the sequence +DDD' (1 2S H ns.6s 

H 6S U s Man tfS ). 

* Example 3; MALDI-MS of a basic fibro- 
jst growth factor (FGF-2) binding saccha- 
le indicated a species with admass of 2808 . 
Irons, which corresponds to a tetradeca sui- 
ted decasaccharide with anhydromanitol at 
e reducing end of the saccharide (Fig. 4A, 
set). Compositional analysis of this sample 
suited in two peaks corresponding to ±D 
iU 2S H NS6<; ) and zU (AU 2S Man 6S ) in the 
tio 3 : t. As this decasaccharide was derived 
/ nitrous acid degradation of heparin, the 
*onic acid at the nonreducing end was not 
pserved by CE (232 nm). Therefore, the 
Dn-educing end residue was identified 
trough sequencing with coenzymes to be 

-D (Us h ns.6s)- 1,16 number of P oss . iblc se " 
uences with this composition is 16 (fig, 4B, 
art i). Of the 16 sequences, those that could 
•suit in the observed fragments upon hepa- 
inase I digestion of the decasaccharide (Fig. 



REPORTS 

4A) are shown in Fig. 4B f | art ii. To resolve 
the isomeric state of the inisrnal uronic acid 
+D versus -D, exhaustive digestion of the 
saccharide with heparinase I and heparinase 
III was performed Heparit ase 1 exhaustive 
digestion of the saccharide resulted in only 
two species corresponding to a trisulfated 
disaccharide (±D) and its anhydromannitol 
derivative, whereas heparhase III did not 
cleave the decasaccharide it all. The above 
facts taken together confirm that the sequence 
of the FGF-2 binding decasaccharide was 

DDDDD' [(l 2S H NS 6S ) 4 U5 N an 6sl 

Example 4: This is an example of the 
determination of a complex sequence. Com- 
positional analysis of an A Mil binding sac- 
charide indicated the preser ce of three build- 
ing blocks, corresponding to AU 2S H Ns.t,s 

.3S.6S 

( ± 7) in the relative ratio of! :I:I. respectively 
The shortest oligosacchai.de that can be 
formed with this compositi )n corresponds to 
a decasaccharide. which is \ onsistent with the 
MALDI-MS data. The tots I number of pos- 
sible combinations of this tridccasulfated. 
singly acctylated, decasac ;haride sequence 



with the above disaccharide building blocks 
is 320 (Fig. 51), part i). Digestion of this 
decasaccharide with heparinase I resulted in 
four fragments ■ Fig. 5 A). Of the 320 possible 
sequences, onlj 52 sequences satisfied hepa- 
rinase I digestit n data (Fig. 5D, part ii). The 
mass spectrum >f the exhaustive digestion of 
the decasacchat de with heparinase I showed 
mass-to-charge ratio (m/z) values mat corre- 
sponded to a tr sulfated disaccharide and an 
' octasulfated he casaccharide, thereby further 
reducing the Ii t of 52 sequences to 28 se- 
quences (Fig. ; D, pan ii). To further con- 
verge on the se juence, we used a "mass tag** 
at the reducing end of the saccharide, which 
enabled the id< ntification of the saccharide 
sequence close to and at the reducing end 
(//). Treatmen of the semicarbazide-tagged 
decasaccharide A*ith heparinase li resulted in 
traemems with m/z values of 4805.0. 5264.6. 
5320.9, 5381.7, 58917. and 5958.* (Tig. 5B). 
The m/z value* of 5320.9 and 5897.7 corre- 
sponded to a u gged tetrasulfated tetrasacchn- 
ride and to a ta ;ged heptasulfated hexasaccha- 
ride. both conta ning the .V-acetyl glucosamine 
residue. This pi ices +4 or -4 (LGH NAC ftS ) at 
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Fig. 2. Sequencing of octasaccharide. (A) A 10 jiM . solution of an 
octasaccharide sample was analyzed by MALDI-MS..The.noncovalent 
complex of the peptide and the saccharide [11) was observed with. a 
measured m/z value of 6458.2. Shown in parenthesis Is the mass. of .the 
saccharide after subtraction of the peptide mass. - SO a indicates the loss . 
of a labile sulfate of the octasaccharide substrate. Under relatively high- 
laser intensities, larger oligosaccharides (octa in length or longer) can 
lose a sulfate. This undersulfated species is -not a contaminant, as verified 
by CE. (Inset) Compositional analysis of the octasaccharide- was. com- 
pleted by exhaustive digestion with heparinases I. II,. and HI and by 
analysis of the disaccharides with CE as described (H). Naphthalene 
trisulfonlc acid (NTS; 1 p-M) was run as an internal standard Assignments 
of AU 2S -H NS ^ 6S and AU-4 N56S w« re mdde on the basis of the fact that 
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. ^ Heparinase I digest 



Sea Fragments formed 
_ ,.(5?7j (577) (1074) 
±DQD5 ••••■£> *D ±05 
*D0t>5 iD ID . iD* 
itfED; *D ID tD5 
ifl>5D ±D' 4D. *T>5 



(ii) 



rtiparinase W digest . (jji) 
!5e1j Frao^nents formed 



trWy cbmierated with known standards.: Peak areas are shown in pareh- 
tr^eses. (b| I Digestion with heparinase 1 :78) resulted In a, pentasu fated 
tevasaccharide. Of m/z 5300.1 and a itrlsulfated disaccharide of m/z 
4002.6. A KSO. adduct was also observiid for the tfsacchande as a result 

.of matrix preparation conditions. (C) digestion of octasacchande i with, 
heparinase 111 (78) yielded a nine-sulfated hexasacchande. m/z 595S.'. 
(P) shows the convergence of the octrsaccharid* ^« °^ a » c ^ ha f n ^ 

"contains a A4.5 uronic addat the nonrerfudng end. (IM^o^ 
possible Octasaccharide sequences that cian be obtained for three ano 
c%e ±5. The mass of the fragments resulting from digestion of octasac- 

' d^aride with heparinase I Is shown in (ii): : Also shown m (m) are sequences 
fYbm (i) that can satisfy the heparinase ;l data. (Hi) Sequence of octasac- 

:-dVaride from (ii) that satisfies the hepaiinase III digestion data IC). 
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Reports 

the reducing end or one unit from the reducinc Pi*J a j -a . 

end, thereby limiting the numberTf Ste 5 L ZZ^J **** * 

sequences from 28 to 6 (Fig. 5D pan "fift ™ Ll T ^ dlxa ^^<k provided 

v >g. 3u, part m). no addihonal constrai its but confirmed the he- 
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Fig. 3. Sequencing of 
hexasaccharide. (A) Ni- 
trous add treatment (20 
min) of hexasaccharide 
(7fi) resulted In a ladder 
containing the starting 
material (with Manjf 
(m/z 586*5), a penta- 
sulfated tetrasaccharide 
[with ManJ S304.1), 
and disulfated disacdiaride 
(m/2 4726.2) [with Man J. 
R indicates ring contrac- 
tion (ard a mass difference 
of 97 daltons) from a N- 
sulfoted hexosamlne to 
anhydromannose (/, 2). 
Shown in parentheses is 
the mass of the saccharide 
after subtraction of the 
peptide mass. (B) The ni- 
trous arid-treated hexasac- 
charide was subjected to 
iduronate 2-0 suifatase, 
iduronidase, and glu- 
cosamine 6-0 suifatase 
(72). The starting material 
(m/2 5A84.6) and a pen- 
tasaccharide (P) (m/z 
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fig. 4. Sequencing 
FGF-2 binding deca- 
saccharide. (A) (inset) 
MALDI-MS was per- 
formed to determine 
the mass and size of 
the saccharide as a 
complex with FGF-2 
(191 Dimers of FCF-2 
bound to the saccha- 
ride (S) yielding a spe 
des with a m/z 
37,009. By subtraction 
of FCF-2 molecular 
weight, the molecular 
mass of the saccharide 
was determined to be 
2808, corresponding to 
a deca saccharide with 
14 sulfates and an an- 
hydromannitol at tie 
reducing end. (A) He- 
parinase I digestion of 
the decasaccharide yield- 
ed a pentasulfated tet- 
rasaccharide (m/i 52863) 
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Possible sequences 

i. +ODCDD 9t 40ix>pa ;■: 

2 +CODCXJ 10.-KK30MT 

3. 40DO00 11.-KX3MD. 

4. +OODOO 12+DTOW 
5 «CMDD M MyOtCb 
6. +OMD.D 14. 

7 +OOD-OD 15.4O0MC7 
[6 W&D 



.Seq^^,. : .:^rnent5 formed ■ .. 
^^<^^W7) (1069) 



Tin 

Sequence ; i £r»rneits formed 
4hr^"' : ( 



e nitrous acid dices. 
Hon, however, gave only the reducing end tet- 
rasacchar de (with and without the tag) as an 
unchppet product (Fig. 5Cj. This confirmed 
+ « c r ~ 4 ^H^) was one unit away 
from the : educing end. Sequential use of exoen- 
zymes ut iquely resolved the isomeric state of 
the uroni: acid as +4 and the reducing end 
disacchar de as -7. which is consistent with 
4-7 beini the key AT-IU binding motif (J) 
nws. we deduced the sequence of the AT-ljj 
binding decasaccharide as ±DDD4 7 

The s jquencing approach can be readily 
extended to other complex polysaccharides 
oy developing appropriate experimental con- 
straints. For example,, the deimatan/chon- 
droitin rrucopolysaccharides (DCMPs). con- 
sisting cf a disaccharidc repeat unit are 
amenable to a hexadecimal coding svstcm 
and MA .DI-MS (/J). As is observed for 
HLGAGj , in DCMPs a given mass has a 
unique signature associated with length and 
composn on. For instance, the minimum dif- 
ference I enveen any disacciiaride and anv 
tetrasaccl aride is 139.2 daltons; therefore, 
the lengn and the number of sulfates ard 
acetates . an be readily assigned for a givJn 
DCM oligosaccharide up t0 an octadecasa;- 
charidc. iimibrly. in the case of polysiaiic 
acids {Pf As), which are present mostlv as 
homopoI;-mers of 5-.V-aceiyineumminic acid 
(NAN j or : -A-ylvcolylneuraminic acid (NGN"i t 
the he.xai ecimal coding system can be easily 
extended ro NAN. NGN to encode the varia- 
tions in he functional groups and enable a 
sequencirjj approach for PSAs [16). 

The examples outlined here represent a 
practical md rapid sequencing methodology 
for HLG \Gs in particular and for complex 
poiysacc! arides in general, so that it is pos- 
sible to arive at a single solution regardless 
of modifi ;ations of ^polysaccharide chain. 
The advantages cf the sequencing approach 
arc three bid. First, through examination of 
all possible sequences for a oligosaccharide 
chain, it is possible to design. .rurther experi- . ... 
mental, cc nstraints.that will result m convef- '. : ' 
gence to: a single . sequence - in- the fewest .. ; 
... number oT steps, thus making diir sequehciitg. : 
approach: rapid. The entire analysis .iii. each .. 
. example .jsed only ]• pmol- of. material and ... 
took less than a day of experiments" to arrive 
at the fin d sequencei as. compared with the 
nucrograj^-to-rninigram . amounts" . required 
for time-iitensive methods (/i). Second, be* 
cause mi:; apnroach starts frAm »n ^c«*ki*'*:1 



cause mi:;, approach starts from all. possible' 
sequencer and eiiminates , those that do nori 
conform ;:o the experimental constraints, it 5 ! 

was assigned to be Wol£ne7£Z^™!!^ . . ensures tliat unusual sequences'ire not elinvjf 

^Tl^softrJfr^r^^ 

*° *own in («) are & rhe sequences E.l n^^^ Ily not ^.aBow for portability of the ap 1 

decasaccharide tz\ ^JtS^a^.W.™ satis^.heparlnasft I. diges^on' dati m ^u^J 1 ^ . nrnarkH,.: Lu ^ a„J„"L™ T„1 
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Abo shown in ^ areX ^nceT^ not 

^sacoharide frim (ii) thaS^t^ V P^hbu: will also facOiW^opment ofa ] 

s« .^ [jepar^ • my moimcJ S c qucncing raet hodology (J 7)!$ 
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~rtS?4*J!k&1f binding de «««haride. (A) Heparinase I 
est of he decattohar.de yielded four fragments. The major frag 

7\ t tS^SEg?*?' 5ing '? aC6t y , " ed saccharide (m/z 
9.7). a heptasutfated. i.ngly acetylated hexasacchar.de Wr S842 1) 

asac haLX? , TC' S l PreSent J S a contam ™"t (•). a pentasulfated 
\ Z ' Jl 4i The * CM «a»wWe was first modified at the reduc- 

,!ph ?m V ro<J " C f 3 ^ tag ' A ro/z of S 6 - 1 « *own as • in (B) 
& J ,J - ^ p ' Cal y,eld5 for ,ne mass - ta 8 ^beting vary between 80 
90% as detennlned by C£. (B) Heparinase II d gestion of tr* 

%£SS& m Z c \ B l'J, {la ^ e<S he Ptasulfated. singly acetylated 
isacchande). m/z S381.7 (hexasulfated tetrasaccharid!) m/z 5320 9 
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' Sequence 


Fragme nis formed and their mass 
577 1)37 1731 1093 1G70 
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±0 ±|4 ±DDD 
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(ngged tetrasaulfated tetrasaccharide) 
re: accharide), and m/2 4805.0 (a trisuifa't 
nnrous acid treatment of decasaccharid. 
fated, singly acetylated, anhydromannltc 
na :ed T (one tagged m/2 5241.5 end 
mc icates ring contraction. As shown h 
tetrasaccharide with iduronidase (and r 
sp* cies of m/z 5007.8. corresponding 
re! due. Further treatment with exoenzyi 
[g\ jcosam.ne 6-0 sulfatase. hexosamida: 
thi complete digestion of the trisaccharid 
birding decasacchartfe sequence from 3 
28 to 6 to the final sequence. 



m/z 5264.6 (tetrasulfated tet~ 
?d disaccharide). (C) Exhaustive 
essentially gives one tetrasul- 
i tetrasaccharide species design 
one untagged m/z 5 186.5). R 
■ the inset, treatment of this 
ot glucuronidase) results in a 
to the removal of iduronate 
nes in the following order only 
e. and glucuronidase) results in 
2. (D) Convergence of the AT-lll 
10 possible sequences to 52 to 
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10. Here, the diwccharioe building Woclcs ol HLCACs are 
first represented with the uro ttc acid st the nonre- 
ducing end and the hwosamir * at the -educing end 
(U-H). It is abo ,-ossiblt to hav» H at the nonreduring 
*"d to define the repeating disaccharides in the 
HLCAC chain (H~U), although this wiH not concep. 
tualty change the analysis 

.n. T|»;.rnethodi3logy used. for_tfte,analysis.of acidic 
poiysacdwrid*:^ 

cWerasrton^ 

™ ^:p^d$a^ 
Hiass.spec»^r^ 

5?f?e^ J a^^^^^ 

S£ ^ ? 333 i 0»*k ; ^ J. Romberg .et af.. 

A ^..Rhom%: et ai; aW.- jfcijHgjfl;-:-.. . . . \ • . • . 
12. ^a^t with:hep>i^ 

w^l 0 " •^f^^> j iS^-ft:tHa-tot« of.'.' 
^ycoslo^vu^u^^^ 

jwily . G . coritaimiig. linkages. However ■ 

HLCACs. 

whereas : heparinase M ; can • dJ^. at: l; in . unsulfated 



13; 



14: 



■ • * n'" 
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.regions of HMACs. Using defined substrates we 
defined the de »v 3 ge conditions for heparinase l and 
III under our rei :tion conditions. We found that there 
were certain v ell-defined parameters under which 
heparinase I ar d III dip at their respective primary 
sites and not at the>r secondary sites (supplementary 
data m avaiUsle at Szierce Online at www 
sciencemag.cr & :featuni/data/1042118jhl). In part 
these, studies oefined' tfie designated . "sKort'vand 

^wtlve"remi6ncondKionsotrtliriedbe^^ 
,!nejchcase,tjOTughthei9ecrfDursev^ " 
and Independen; experimental; constraints (that fe uv'-' • 
. complete nitrou; add.fclJowed by ewen^^ 
haustivc 1 nitrous: add and con^monal analysis) we 
can confirm the: sequence assignment obtained with " 
heparfnases as i xperime^iaJ constraint ■ 
Otfw-mathbds Induding the use of txiiaj^i^i^.-.'..' 
being developec for sequericing saccharides, arid for ' • 
• the most part require sampte isolation and repurffi- ' 
ataon [ j- H. Ti;mb«U- ei al, 7>roc. /Vatf. AcU Sd. <' 
..ttlifc: 96/270:; (1999j/.and. references. therein; R, 
Vlves et 9l:. BiovhemJ. 339, 767 (1999JJ. However, 
. tne^MALDl-MS approach, for sequendng. saccharides - 
With exoenzyrrif s does not require repuriflcatfon of • 
the saccharide after manipulations. The MALJDl mass 
spectrogram is ;ible to generate all the sequence ' 
mfonnation 6f i\[ the saccharide fragments. In a ■ 
ladder form in: one spectrum. 
The AT-Iir bindirj decasaaharlde was a gift from R. 
Lmhardt The"se quehce of: this decasaccharide has 
been reported ti: be- D4-7iJo. an the basis of NMR ' 
spectroscopy |Y.: Toida et aL. J. BioL Oiem. 271, • 



1999. 
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32040 (1996)). Such a sequence jhould show the 
appearance of a lagged D or OD residue at the 
reducing end. However, we find aU the different 
experiments used in the elucidation of the decasac- 
chartde sequence to be consistent with each other in 
the appearance of a 4-7 tagged product and not a 0 
(or a DD) product This saccharide does not contain 
an Intact AT- II J binding site, as proposed Therefore 
w* sought confirmation of our proposed sequence 
through the use of integral glycan sequencing (ICS) 
which agreed with our analysis. 
IS. OCMPs are found in dense connective tissues such as 
bone and cartilage. The bask repeat unit of DCMP 
can be represented as -(3 1->4) l) Zx -( Q /p t^j 
G«Uc43cm-. where U is uronic acid. Gal^ is a 
rv-acetylared galactosemia, and there are 16 disac- 
charide building Wocks for DCMP. Like the hepari- 
nases that degrade HLGAGs, there are distinct chon- 
doroitinasM and other chemical methods available 
that dip at specific giycosidic linkages of DCMP and 
serve as experimental constraints. Furthermore, be- 
cause DCMPs are acidic polysaccharides, the MALDl- 
MS techniques and methods used for HLGAGs can be 
readily extended to the DCMPs. 
16. The monomelic units of NAN and NGN are linked by 
a 2-8 giycosidic linkages and can be modified at the 
.4-0. 7-0, and 9-0 positions. Methods of purifying 
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and character^ ,g ps A oligosaccharides using higiv 

performance liqx id chromatography. CE. and MS have 
very recently be m established, m addition, chemical 
and omsWydas, s and enofowtydasM that cleave at 
distinct glycoside linkage of PSA are available and 
can serve as exp mmental constraints. 
1 7. The sequencing * ^proach presently uses a brute force 
method because many of the rules regarding the 
spectfrcity of enz mes thai degrade anC modify com. 
P'ex polysacchar, Jes are ir. the developmental staee 
Once these rules or constraints are fully developld! 
more inteHlgent slgorithrr., such as a genetic atao- 
rithm or Monte < arto optimitation could be used to 
search a much ru -rower starch space for the correct 
sequence. The o mbfnatkm of more efficient con- 
strain* and aigo ith/ro will thus lead to a fully au- 
tomated sequent ng approach. 
18. Th e use of hepari, ases was essentially as described in 
references rn<7/) Digests were designated as either 
£T*JU *i? au5t ' /e - digests were completed 
with 50 nM eray ne for 10 min. Exhaustive digests 
were completed *h 200 nM enzyme for either 4 
hours or ovemigh . Partial nitrous acid deavage was 
completed using . modification of published proce- 
auret Briefly, to ,n aqueojj solution of saccharide 
was added a 2x 3 Mutton o- sodium nitrite in HC1 so 
that the concentre :ion of nitrous acid was 2 mM and 
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t let of HQ was 20 mM. The reaction was allowed 
r. •oceed at room temperature with quenching cf 

M of 200 mM sodium acetate and of bovine sen 
a bum* {i mg/mt pH 6.0). Exhaustive rfeou a 

n Mnftous add in HO overnight at room tenperan! 
It both cases, « was found that the prtxfucts ofnfot 

^^ g ^ ddbes ^^ y byMA10lwn 
o tether cleanup and without the need to reduce t 

dased I from Oxford Qycosystems (wtkeTMrf, m 
at d used as suggested by the manufacturer 

19 ' m ]u5oS? ef a/ " froc ' NitL Aead - fci U5 - 
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Stability and Variability in 
Competitive Communities 

A. R, Ives,* K. Cross, J. L Klug 

S^ t V nl ability ^ the , « UndanCe ° f W ulations de P- * on the sensitrvty 
otspeges to environmental fluctuations and the amDlifieVti. n of *1 5ens,tJV ^ 
fluctuations by interaction^m™**^ • a,.l ^P 1 '" 0111 n of environmental 

species n^S^^SS^t^ * " d 
^ teve,. a coring JS^'^S^^J^ 
no effect on variability measured At tha ™~ -1 i . * aors ha "* 

may increase «mw£*£^ 7,1 !re, °^ ^ersity 

responses to envVonmema! «uSL5KJ^^ ** 
of competitive interactions has little effea * 

» 



The stabd.ty or an ecological community is- 
thought to depend on the number of species it 
contauis an d the strengths of interactions be- 
tween them {/). Stabilirv- also depends on the 
organizational level at which it is measured 
In model ecosystems. May l2) jhowed ^ 
increasing the strength and number of species 
interactions decreases the stability of individ- 
ual spec.es' dynamics. In contrast, experi- 
mental studies (J) showed that large commu- 
tes of plan, competitors may be more sta- ' 
ole than small communities when stabilirv is - 
measured in terms of the total biomass of ail 
species. 

The stability of ecological communities 
'ommonly characterized bv one of five 
Properties; mathematical stability, resil. 
»ence, resistance, persistence of" species 
and variability (4). Of these, variability is''-- 
the most frequently measured, yet least un- 

naoison. Madison, Wl S3 706. USA. 



derstood from a theore-.ical perspective. 
Our analyses addiess the effects of inter- 
specific competition and species civersilv 
on variability in tl e biomass of individual 
spec.es and variab liry in total community 
biomass. Environn ental fluctuations cause 
s™*^ (>ear : tt--year), changes, in spe- 
cies.:bioma S5 e^>S^ies; ihte ra ctid^ then : 

eny^n^efltal^n hlta - 

by 'the 

strength , of species imteracfibns . and we ^^: : 
number of ipecics^aicdmmum^. We use 

: ^««Wc-ars^ 

*<(*+ 



'J F[ ] is a (unction giving .ho chance ir 
b.omcss between years; r and A." are th £ 
species-specific intrinsic rate of increase and 
canyi ig capacity, respectively; and a is the 
companion coefficieni measurina .he etT-, 
of spe tics; on species ;. The modd is par»^. 
etenzed so that, in the absence of environ- 
ments variability, when r > |. p„ pilia , ions 
show , % ereompensatmg dynamics, and when 
r, < 1 biomasses tend to approach eqailib- 
num rionotonicaMy. Environmental fluctua- 
tions a-e included with species-specific ran- 
dom v iriables tfti To account for similari- 
ties an cng species in their responses to e-- 
vironm :ntal fluctuations, we assume th^« 

vilues are correlated with correbi-cn 
coetTic cm p. 

\Ve measure variability at two levels 
For the species level, we use the sum of ihe 
variant ei of the biomasses of constituent 
spec.es V; = S;.,r[.r,(/ ( j. where n . is the 
numbei of species in ihe community For 
the community level, we consider the vari- 
. ance oi the aggi-egate bibni^ in the com- • 

variability m this. way relates bur results 
duwtjy to experimental studies emplovioe" 
vananc :s {3-5); 

• ; • : ^al;Kticai . approaches provide . genial ' ~ 
conclusions that ate independent ioWetallea •■• 
modeVtssumptwns. - AlttougK:':-vanaiices- ut 
species liiomasses produced by;Eq, l depend -. 
on nonlrneanries in F{ ]; variances ■ can be 

sion fho<lel (d) . 



ml^rr ,Spmd!Wt Jh0uid be 'ddressed. E- 
man imtt@txs Uf i .wise .edu 
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sion rhCNlel (d) 

where X-7) is a vector of jlpecies biomasses*^ 
Ar^/) rel itive to ihe mean [te.(t) = X( p) - g"^ 
and E(/):is a.Vwtor:d 
contains Oie environment 
m species biomasses from one /year , to ibc-- . 



